34 research outputs found
Finding regulatory DNA motifs using alignment-free evolutionary conservation information
As an increasing number of eukaryotic genomes are being sequenced, comparative studies aimed at detecting regulatory elements in intergenic sequences are becoming more prevalent. Most comparative methods for transcription factor (TF) binding site discovery make use of global or local alignments of orthologous regulatory regions to assess whether a particular DNA site is conserved across related organisms, and thus more likely to be functional. Since binding sites are usually short, sometimes degenerate, and often independent of orientation, alignment algorithms may not align them correctly. Here, we present a novel, alignment-free approach for using conservation information for TF binding site discovery. We relax the definition of conserved sites: we consider a DNA site within a regulatory region to be conserved in an orthologous sequence if it occurs anywhere in that sequence, irrespective of orientation. We use this definition to derive informative priors over DNA sequence positions, and incorporate these priors into a Gibbs sampling algorithm for motif discovery. Our approach is simple and fast. It requires neither sequence alignments nor the phylogenetic relationships between the orthologous sequences, yet it is more effective on real biological data than methods that do
A Nucleosome-Guided Map of Transcription Factor Binding Sites in Yeast
Finding functional DNA binding sites of transcription factors (TFs) throughout the genome is a crucial step in understanding transcriptional regulation. Unfortunately, these binding sites are typically short and degenerate, posing a significant statistical challenge: many more matches to known TF motifs occur in the genome than are actually functional. However, information about chromatin structure may help to identify the functional sites. In particular, it has been shown that active regulatory regions are usually depleted of nucleosomes, thereby enabling TFs to bind DNA in those regions. Here, we describe a novel motif discovery algorithm that employs an informative prior over DNA sequence positions based on a discriminative view of nucleosome occupancy. When a Gibbs sampling algorithm is applied to yeast sequence-sets identified by ChIP-chip, the correct motif is found in 52% more cases with our informative prior than with the commonly used uniform prior. This is the first demonstration that nucleosome occupancy information can be used to improve motif discovery. The improvement is dramatic, even though we are using only a statistical model to predict nucleosome occupancy; we expect our results to improve further as high-resolution genome-wide experimental nucleosome occupancy data becomes increasingly available
Recommended from our members
The vitamin D receptor gene as a determinant of survival in pancreatic cancer patients: Genomic analysis and experimental validation
Purpose: Advanced pancreatic cancer is a highly refractory disease almost always associated with survival of little more than a year. New interventions based on novel targets are needed. We aim to identify new genetic determinants of overall survival (OS) in patients after treatment with gemcitabine using genome-wide screens of germline DNA. We aim also to support these findings with in vitro functional analysis. Patients and methods: Genome-wide screens of germline DNA in two independent cohorts of pancreatic cancer patients (from the Cancer and Leukemia Group B (CALGB) 80303 and the Mayo Clinic) were used to select new genes associated with OS. The vitamin D receptor gene (VDR) was selected, and the interactions of genetic variation in VDR with circulating vitamin D levels and gemcitabine treatment were evaluated. Functional effects of common VDR variants were also evaluated in experimental assays in human cell lines. Results: The rs2853564 variant in VDR was associated with OS in patients from both the Mayo Clinic (HR 0.81, 95% CI 0.70–0.94, p = 0.0059) and CALGB 80303 (HR 0.74, 0.63–0.87, p = 0.0002). rs2853564 interacted with high pre-treatment levels of 25-hydroxyvitamin D (25(OH)D, a measure of endogenous vitamin D) (p = 0.0079 for interaction) and with gemcitabine treatment (p = 0.024 for interaction) to confer increased OS. rs2853564 increased transcriptional activity in luciferase assays and reduced the binding of the IRF4 transcription factor. Conclusion: Our findings propose VDR as a novel determinant of survival in advanced pancreatic cancer patients. Common functional variation in this gene might interact with endogenous vitamin D and gemcitabine treatment to determine improved patient survival. These results support evidence for a modulatory role of the vitamin D pathway for the survival of advanced pancreatic cancer patients.</p